Overview

Dataset statistics

Number of variables21
Number of observations2147
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory515.8 KiB
Average record size in memory246.0 B

Variable types

NUM20
CAT1

Reproduction

Analysis started2020-05-02 15:00:41.952062
Analysis finished2020-05-02 15:02:04.036193
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 1 (< 0.1%) duplicate rows Duplicates
kurt is highly correlated with skewHigh Correlation
skew is highly correlated with kurtHigh Correlation
centroid is highly correlated with meanfreqHigh Correlation
meanfreq is highly correlated with centroidHigh Correlation
dfrange is highly correlated with maxdomHigh Correlation
maxdom is highly correlated with dfrangeHigh Correlation
mode has 57 (2.7%) zeros Zeros

Variables

meanfreq
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1870594791
Minimum0.1125746087
Maximum0.2511237587
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.1125746087
5-th percentile0.1399722295
Q10.1748685002
median0.1886386787
Q30.2011942643
95-th percentile0.2305188825
Maximum0.2511237587
Range0.13854915
Interquartile range (IQR)0.02632576411

Descriptive statistics

Standard deviation0.0249000491
Coefficient of variation (CV)0.1331130036
Kurtosis0.1658105411
Mean0.1870594791
Median Absolute Deviation (MAD)0.0188486892
Skewness-0.3028387724
Sum401.6167016
Variance0.0006200124454
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.11257461 0.12490613 0.14097202 0.16540393 0.17621094 0.20144825 0.21234939 0.23706762 0.24381429 0.25112376], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2137323717 2 0.1%
 
0.2289027479 1 < 0.1%
 
0.1410780758 1 < 0.1%
 
0.1919592257 1 < 0.1%
 
0.2120307575 1 < 0.1%
 
0.2137375189 1 < 0.1%
 
0.2038652175 1 < 0.1%
 
0.1863085001 1 < 0.1%
 
0.1763834864 1 < 0.1%
 
0.1665615201 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.1125746087 1 < 0.1%
 
0.112687253 1 < 0.1%
 
0.1137637158 1 < 0.1%
 
0.1139983213 1 < 0.1%
 
0.1173230883 1 < 0.1%
 
ValueCountFrequency (%) 
0.2511237587 1 < 0.1%
 
0.2470406841 1 < 0.1%
 
0.2443564469 1 < 0.1%
 
0.2432721416 1 < 0.1%
 
0.2432663032 1 < 0.1%
 

sd
Real number (ℝ≥0)

Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05530093694
Minimum0.02178199064
Maximum0.09579842628
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.02178199064
5-th percentile0.03180802534
Q10.04082656847
median0.05831198978
Q30.06350557289
95-th percentile0.08266786103
Maximum0.09579842628
Range0.07401643564
Interquartile range (IQR)0.02267900441

Descriptive statistics

Standard deviation0.015375466
Coefficient of variation (CV)0.2780326492
Kurtosis-0.6619897957
Mean0.05530093694
Median Absolute Deviation (MAD)0.01259476389
Skewness0.08051377599
Sum118.7311116
Variance0.0002364049548
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.02178199 0.02716882 0.03067919 0.03861014 0.04602112 ... 0.05714759 0.06285018 0.06867393 0.08885061 0.09579843], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.05770473882 2 0.1%
 
0.06212247431 1 < 0.1%
 
0.05804067186 1 < 0.1%
 
0.04080598623 1 < 0.1%
 
0.04357460227 1 < 0.1%
 
0.06248637126 1 < 0.1%
 
0.06089676665 1 < 0.1%
 
0.07472792323 1 < 0.1%
 
0.05512079056 1 < 0.1%
 
0.06745093931 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.02178199064 1 < 0.1%
 
0.02506826557 1 < 0.1%
 
0.02550020574 1 < 0.1%
 
0.02551958613 1 < 0.1%
 
0.0259730293 1 < 0.1%
 
ValueCountFrequency (%) 
0.09579842628 1 < 0.1%
 
0.09563517323 1 < 0.1%
 
0.09492076645 1 < 0.1%
 
0.09289041687 1 < 0.1%
 
0.09288353691 1 < 0.1%
 

median
Real number (ℝ≥0)

Distinct count2091
Unique (%)97.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1932625175
Minimum0.1082965932
Maximum0.2612244898
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.1082965932
5-th percentile0.1326622714
Q10.1778258628
median0.1944927536
Q30.2139523559
95-th percentile0.2364824051
Maximum0.2612244898
Range0.1529278966
Interquartile range (IQR)0.03612649317

Descriptive statistics

Standard deviation0.02919244108
Coefficient of variation (CV)0.1510507131
Kurtosis0.2394031112
Mean0.1932625175
Median Absolute Deviation (MAD)0.02270430619
Skewness-0.5699368657
Sum414.934625
Variance0.000852198616
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.10829659 0.15986231 0.17429314 0.21804591 0.23697434 0.24539955 0.26122449], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1834482759 3 0.1%
 
0.1720315582 3 0.1%
 
0.22 3 0.1%
 
0.1866666667 3 0.1%
 
0.2041325536 3 0.1%
 
0.1865123967 2 0.1%
 
0.1889391304 2 0.1%
 
0.2134170153 2 0.1%
 
0.1937456243 2 0.1%
 
0.1891027732 2 0.1%
 
Other values (2081) 2122 98.8%
 
ValueCountFrequency (%) 
0.1082965932 1 < 0.1%
 
0.1083510638 1 < 0.1%
 
0.1094781987 1 < 0.1%
 
0.1097457627 1 < 0.1%
 
0.1098294574 1 < 0.1%
 
ValueCountFrequency (%) 
0.2612244898 1 < 0.1%
 
0.2574170495 1 < 0.1%
 
0.2569835597 1 < 0.1%
 
0.2566312595 1 < 0.1%
 
0.2564019449 1 < 0.1%
 

Q25
Real number (ℝ≥0)

Distinct count2110
Unique (%)98.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1474185559
Minimum0.01993527508
Maximum0.2473469388
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.01993527508
5-th percentile0.06765908522
Q10.1244825373
median0.1430453698
Q30.1774368336
95-th percentile0.2160887894
Maximum0.2473469388
Range0.2274116637
Interquartile range (IQR)0.05295429621

Descriptive statistics

Standard deviation0.04201619713
Coefficient of variation (CV)0.2850129475
Kurtosis-0.06812888722
Mean0.1474185559
Median Absolute Deviation (MAD)0.03365725413
Skewness-0.2905854842
Sum316.5076396
Variance0.001765360822
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01993528 0.0417661 0.0862824 0.09556481 0.12448254 ... 0.16404658 0.18449639 0.2055028 0.22943719 0.24734694], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.14 5 0.2%
 
0.2036363636 3 0.1%
 
0.1716129032 3 0.1%
 
0.1686902928 2 0.1%
 
0.1365153734 2 0.1%
 
0.1917433752 2 0.1%
 
0.2237254902 2 0.1%
 
0.105 2 0.1%
 
0.2235820896 2 0.1%
 
0.175 2 0.1%
 
Other values (2100) 2122 98.8%
 
ValueCountFrequency (%) 
0.01993527508 1 < 0.1%
 
0.02283261803 1 < 0.1%
 
0.02401715511 1 < 0.1%
 
0.02622516556 1 < 0.1%
 
0.02802669209 1 < 0.1%
 
ValueCountFrequency (%) 
0.2473469388 1 < 0.1%
 
0.2421235324 1 < 0.1%
 
0.240735194 1 < 0.1%
 
0.2405416249 1 < 0.1%
 
0.2385829708 1 < 0.1%
 

Q75
Real number (ℝ≥0)

Distinct count2069
Unique (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2296628411
Minimum0.1631165118
Maximum0.2734693878
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.1631165118
5-th percentile0.1966532536
Q10.2139680316
median0.2305333333
Q30.2465191018
95-th percentile0.2583990519
Maximum0.2734693878
Range0.110352876
Interquartile range (IQR)0.03255107016

Descriptive statistics

Standard deviation0.02017126131
Coefficient of variation (CV)0.08782988669
Kurtosis-0.6927704264
Mean0.2296628411
Median Absolute Deviation (MAD)0.01710888942
Skewness-0.2743823662
Sum493.0861199
Variance0.0004068797829
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.16311651 0.18520374 0.19471775 0.20741995 0.20746213 0.25906095 0.26676175 0.27346939], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2333333333 4 0.2%
 
0.2413793103 4 0.2%
 
0.245 4 0.2%
 
0.2418181818 3 0.1%
 
0.2549253731 3 0.1%
 
0.224 3 0.1%
 
0.2499570815 2 0.1%
 
0.2385514612 2 0.1%
 
0.2077659574 2 0.1%
 
0.2140622372 2 0.1%
 
Other values (2059) 2118 98.6%
 
ValueCountFrequency (%) 
0.1631165118 1 < 0.1%
 
0.1657439896 1 < 0.1%
 
0.1681334392 1 < 0.1%
 
0.1693767867 1 < 0.1%
 
0.1694839609 1 < 0.1%
 
ValueCountFrequency (%) 
0.2734693878 1 < 0.1%
 
0.2698517298 1 < 0.1%
 
0.2689373297 1 < 0.1%
 
0.2689240506 1 < 0.1%
 
0.2687919943 1 < 0.1%
 

IQR
Real number (ℝ≥0)

Distinct count2084
Unique (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.08224428519
Minimum0.01658674189
Maximum0.195526658
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.01658674189
5-th percentile0.02711022704
Q10.04403782563
median0.09107526882
Q30.1123675488
95-th percentile0.1441319139
Maximum0.195526658
Range0.1789399161
Interquartile range (IQR)0.06832972312

Descriptive statistics

Standard deviation0.03899081434
Coefficient of variation (CV)0.4740853938
Kurtosis-0.8970459927
Mean0.08224428519
Median Absolute Deviation (MAD)0.03445675249
Skewness0.1502248831
Sum176.5784803
Variance0.001520283603
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01658674 0.02030467 0.02533616 0.05246359 0.06350478 ... 0.10077909 0.1206064 0.12846758 0.18203592 0.19552666], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.035 4 0.2%
 
0.105 4 0.2%
 
0.04666666667 3 0.1%
 
0.04307692308 3 0.1%
 
0.1184615385 2 0.1%
 
0.1119311193 2 0.1%
 
0.04433333333 2 0.1%
 
0.1091176471 2 0.1%
 
0.1067150635 2 0.1%
 
0.1017755444 2 0.1%
 
Other values (2074) 2121 98.8%
 
ValueCountFrequency (%) 
0.01658674189 1 < 0.1%
 
0.01742895805 1 < 0.1%
 
0.01800643087 1 < 0.1%
 
0.01817218543 1 < 0.1%
 
0.01858541893 1 < 0.1%
 
ValueCountFrequency (%) 
0.195526658 1 < 0.1%
 
0.1909363831 1 < 0.1%
 
0.1846098868 1 < 0.1%
 
0.1825903614 1 < 0.1%
 
0.1814814815 1 < 0.1%
 

skew
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.073514085
Minimum0.2850202853
Maximum4.316058818
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.2850202853
5-th percentile1.090451625
Q11.55429069
median2.00091713
Q32.531818639
95-th percentile3.347653841
Maximum4.316058818
Range4.031038532
Interquartile range (IQR)0.9775279492

Descriptive statistics

Standard deviation0.6895606485
Coefficient of variation (CV)0.3325565298
Kurtosis-0.2283313113
Mean2.073514085
Median Absolute Deviation (MAD)0.5601325066
Skewness0.4661334878
Sum4451.834741
Variance0.475493888
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.28502029 0.69149892 0.90823648 1.08766507 1.31926702 ... 2.68051138 3.13681312 3.62239673 4.00677045 4.31605882], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.113598278 2 0.1%
 
2.163847944 1 < 0.1%
 
3.058550761 1 < 0.1%
 
2.069733704 1 < 0.1%
 
3.371422074 1 < 0.1%
 
2.068888018 1 < 0.1%
 
2.001373324 1 < 0.1%
 
4.065658329 1 < 0.1%
 
3.896396746 1 < 0.1%
 
2.073218316 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.2850202853 1 < 0.1%
 
0.5487427053 1 < 0.1%
 
0.5897874639 1 < 0.1%
 
0.608051472 1 < 0.1%
 
0.6083710438 1 < 0.1%
 
ValueCountFrequency (%) 
4.316058818 1 < 0.1%
 
4.281047254 1 < 0.1%
 
4.154631288 1 < 0.1%
 
4.147547098 1 < 0.1%
 
4.112595664 1 < 0.1%
 

kurt
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.479436401
Minimum2.293368
Maximum25.59206834
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum2.293368
5-th percentile3.589617387
Q15.236555608
median7.308962828
Q310.49323056
95-th percentile17.60373782
Maximum25.59206834
Range23.29870034
Interquartile range (IQR)5.25667495

Descriptive statistics

Standard deviation4.403373771
Coefficient of variation (CV)0.5193002886
Kurtosis1.676492459
Mean8.479436401
Median Absolute Deviation (MAD)3.403511175
Skewness1.321712021
Sum18205.34995
Variance19.38970056
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2.293368 2.61528917 3.35172491 3.9925164 6.9236621 ... 10.49323056 14.7390388 16.16342836 20.12135027 25.59206834], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7.890926985 2 0.1%
 
3.820472506 1 < 0.1%
 
14.45188832 1 < 0.1%
 
8.115036954 1 < 0.1%
 
9.781177928 1 < 0.1%
 
3.693392976 1 < 0.1%
 
5.199408012 1 < 0.1%
 
4.531100471 1 < 0.1%
 
11.01607587 1 < 0.1%
 
6.398563412 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
2.293368 1 < 0.1%
 
2.46256132 1 < 0.1%
 
2.527880398 1 < 0.1%
 
2.603362101 1 < 0.1%
 
2.627216238 1 < 0.1%
 
ValueCountFrequency (%) 
25.59206834 1 < 0.1%
 
25.42744044 1 < 0.1%
 
25.38702215 1 < 0.1%
 
25.32262443 1 < 0.1%
 
25.22221741 1 < 0.1%
 

sp.ent
Real number (ℝ≥0)

Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.896081886
Minimum0.7711673323
Maximum0.9765329702
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.7711673323
5-th percentile0.8203984316
Q10.8646363488
median0.9016863291
Q30.9266093325
95-th percentile0.9630603704
Maximum0.9765329702
Range0.2053656379
Interquartile range (IQR)0.06197298372

Descriptive statistics

Standard deviation0.04259591666
Coefficient of variation (CV)0.04753574123
Kurtosis-0.4938296522
Mean0.896081886
Median Absolute Deviation (MAD)0.03501713152
Skewness-0.3637771111
Sum1923.887809
Variance0.001814412116
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.77116733 0.79946337 0.81671027 0.84305589 0.84305709 0.89060231 0.93530743 0.96840759 0.97653297], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.8597123484 2 0.1%
 
0.8960124174 1 < 0.1%
 
0.9437210184 1 < 0.1%
 
0.9257820649 1 < 0.1%
 
0.8549195159 1 < 0.1%
 
0.7900474456 1 < 0.1%
 
0.8502610037 1 < 0.1%
 
0.9162869554 1 < 0.1%
 
0.8504492364 1 < 0.1%
 
0.8762574598 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.7711673323 1 < 0.1%
 
0.7713014082 1 < 0.1%
 
0.7718593829 1 < 0.1%
 
0.7727412762 1 < 0.1%
 
0.7738310166 1 < 0.1%
 
ValueCountFrequency (%) 
0.9765329702 1 < 0.1%
 
0.9764626195 1 < 0.1%
 
0.976355461 1 < 0.1%
 
0.9758139234 1 < 0.1%
 
0.9753869363 1 < 0.1%
 

sfm
Real number (ℝ≥0)

Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3915937023
Minimum0.08220408541
Maximum0.8260991385
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.08220408541
5-th percentile0.1584448722
Q10.2491830445
median0.361472534
Q30.5092714429
95-th percentile0.7283540146
Maximum0.8260991385
Range0.7438950531
Interquartile range (IQR)0.2600883983

Descriptive statistics

Standard deviation0.1741391803
Coefficient of variation (CV)0.4446935159
Kurtosis-0.6326895567
Mean0.3915937023
Median Absolute Deviation (MAD)0.1456743056
Skewness0.5193885095
Sum840.7516789
Variance0.03032445411
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.08220409 0.11809323 0.16981495 0.32469214 0.5401082 0.78752155 0.82609914], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.08493436355 2 0.1%
 
0.5809385427 1 < 0.1%
 
0.2345334848 1 < 0.1%
 
0.1969930127 1 < 0.1%
 
0.1792393576 1 < 0.1%
 
0.5597423183 1 < 0.1%
 
0.1701727166 1 < 0.1%
 
0.5604066953 1 < 0.1%
 
0.3876420051 1 < 0.1%
 
0.4375912436 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.08220408541 1 < 0.1%
 
0.08493436355 2 0.1%
 
0.09335855486 1 < 0.1%
 
0.09445510179 1 < 0.1%
 
0.09467554002 1 < 0.1%
 
ValueCountFrequency (%) 
0.8260991385 1 < 0.1%
 
0.8226706545 1 < 0.1%
 
0.8225866361 1 < 0.1%
 
0.8180562992 1 < 0.1%
 
0.8130875954 1 < 0.1%
 

mode
Real number (ℝ≥0)

ZEROS
Distinct count2045
Unique (%)95.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1847511453
Minimum0
Maximum0.28
Zeros57
Zeros (%)2.7%
Memory size16.9 KiB

Quantile statistics

Minimum0
5-th percentile0.04999161777
Q10.1598844002
median0.1969686985
Q30.2300726819
95-th percentile0.2650716534
Maximum0.28
Range0.28
Interquartile range (IQR)0.07018828171

Descriptive statistics

Standard deviation0.06386627131
Coefficient of variation (CV)0.3456880941
Kurtosis1.036461832
Mean0.1847511453
Median Absolute Deviation (MAD)0.04832893441
Skewness-1.115941192
Sum396.660709
Variance0.004078900612
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00152807 0.02214741 0.0498901 0.05026062 ... 0.18925633 0.20797918 0.26160889 0.27985169 0.28 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 57 2.7%
 
0.28 9 0.4%
 
0.1866666667 5 0.2%
 
0.05013333333 3 0.1%
 
0.245 3 0.1%
 
0.239762963 2 0.1%
 
0.1938461538 2 0.1%
 
0.04999161777 2 0.1%
 
0.210097629 2 0.1%
 
0.1082411436 2 0.1%
 
Other values (2035) 2060 95.9%
 
ValueCountFrequency (%) 
0 57 2.7%
 
0.003056133056 1 < 0.1%
 
0.004603288063 1 < 0.1%
 
0.004875621891 1 < 0.1%
 
0.005390898483 1 < 0.1%
 
ValueCountFrequency (%) 
0.28 9 0.4%
 
0.2797033898 1 < 0.1%
 
0.2795851852 1 < 0.1%
 
0.2795229983 1 < 0.1%
 
0.2791181102 1 < 0.1%
 

centroid
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1870594791
Minimum0.1125746087
Maximum0.2511237587
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.1125746087
5-th percentile0.1399722295
Q10.1748685002
median0.1886386787
Q30.2011942643
95-th percentile0.2305188825
Maximum0.2511237587
Range0.13854915
Interquartile range (IQR)0.02632576411

Descriptive statistics

Standard deviation0.0249000491
Coefficient of variation (CV)0.1331130036
Kurtosis0.1658105411
Mean0.1870594791
Median Absolute Deviation (MAD)0.0188486892
Skewness-0.3028387724
Sum401.6167016
Variance0.0006200124454
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.11257461 0.12490613 0.14097202 0.16540393 0.17621094 0.20144825 0.21234939 0.23706762 0.24381429 0.25112376], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2137323717 2 0.1%
 
0.2289027479 1 < 0.1%
 
0.1410780758 1 < 0.1%
 
0.1919592257 1 < 0.1%
 
0.2120307575 1 < 0.1%
 
0.2137375189 1 < 0.1%
 
0.2038652175 1 < 0.1%
 
0.1863085001 1 < 0.1%
 
0.1763834864 1 < 0.1%
 
0.1665615201 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.1125746087 1 < 0.1%
 
0.112687253 1 < 0.1%
 
0.1137637158 1 < 0.1%
 
0.1139983213 1 < 0.1%
 
0.1173230883 1 < 0.1%
 
ValueCountFrequency (%) 
0.2511237587 1 < 0.1%
 
0.2470406841 1 < 0.1%
 
0.2443564469 1 < 0.1%
 
0.2432721416 1 < 0.1%
 
0.2432663032 1 < 0.1%
 

meanfun
Real number (ℝ≥0)

Distinct count2146
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1453491138
Minimum0.07116814457
Maximum0.2257554714
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.07116814457
5-th percentile0.1017074817
Q10.1227695399
median0.1414446349
Q30.1698690581
95-th percentile0.1915817037
Maximum0.2257554714
Range0.1545873269
Interquartile range (IQR)0.04709951823

Descriptive statistics

Standard deviation0.02891874278
Coefficient of variation (CV)0.1989605717
Kurtosis-0.9345505894
Mean0.1453491138
Median Absolute Deviation (MAD)0.02516219264
Skewness0.1100841701
Sum312.0645474
Variance0.0008362936841
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.07116814 0.09050164 0.10163275 0.10871818 0.12375133 ... 0.18294314 0.19388853 0.20181843 0.21339336 0.22575547], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1336673026 2 0.1%
 
0.1650750785 1 < 0.1%
 
0.1571508312 1 < 0.1%
 
0.1682575029 1 < 0.1%
 
0.1727898711 1 < 0.1%
 
0.1013949163 1 < 0.1%
 
0.1137630326 1 < 0.1%
 
0.1418104062 1 < 0.1%
 
0.1341726856 1 < 0.1%
 
0.1749417791 1 < 0.1%
 
Other values (2136) 2136 99.5%
 
ValueCountFrequency (%) 
0.07116814457 1 < 0.1%
 
0.07173922044 1 < 0.1%
 
0.07877827652 1 < 0.1%
 
0.08045982686 1 < 0.1%
 
0.08061413048 1 < 0.1%
 
ValueCountFrequency (%) 
0.2257554714 1 < 0.1%
 
0.2165806582 1 < 0.1%
 
0.2162084628 1 < 0.1%
 
0.2137212584 1 < 0.1%
 
0.2130654584 1 < 0.1%
 

minfun
Real number (ℝ≥0)

Distinct count621
Unique (%)28.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03834713862
Minimum0.009775171065
Maximum0.09142857143
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.009775171065
5-th percentile0.01597284951
Q10.02043436065
median0.04710500491
Q30.04809619238
95-th percentile0.05206073753
Maximum0.09142857143
Range0.08165340036
Interquartile range (IQR)0.02766183173

Descriptive statistics

Standard deviation0.01468286113
Coefficient of variation (CV)0.3828932656
Kurtosis-0.9042928865
Mean0.03834713862
Median Absolute Deviation (MAD)0.01328316812
Skewness-0.3633667957
Sum82.33130661
Variance0.0002155864109
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00977517 0.0154654 0.01564793 0.01685098 0.01950984 ... 0.05013056 0.05189195 0.05514074 0.07067932 0.09142857], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.04692082111 59 2.7%
 
0.04710500491 51 2.4%
 
0.0469667319 47 2.2%
 
0.04701273262 46 2.1%
 
0.04705882353 42 2.0%
 
0.0473840079 39 1.8%
 
0.04715127701 37 1.7%
 
0.04729064039 35 1.6%
 
0.04719764012 32 1.5%
 
0.04743083004 29 1.4%
 
Other values (611) 1730 80.6%
 
ValueCountFrequency (%) 
0.009775171065 1 < 0.1%
 
0.009900990099 1 < 0.1%
 
0.01107419712 1 < 0.1%
 
0.01302083333 1 < 0.1%
 
0.01481481481 1 < 0.1%
 
ValueCountFrequency (%) 
0.09142857143 1 < 0.1%
 
0.09039548023 1 < 0.1%
 
0.08556149733 1 < 0.1%
 
0.08474576271 1 < 0.1%
 
0.08376963351 1 < 0.1%
 

maxfun
Real number (ℝ≥0)

Distinct count44
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2686778589
Minimum0.2191780822
Maximum0.2791139241
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.2191780822
5-th percentile0.233378933
Q10.2666666667
median0.275862069
Q30.2774566474
95-th percentile0.2790697674
Maximum0.2791139241
Range0.05993584186
Interquartile range (IQR)0.01078998073

Descriptive statistics

Standard deviation0.01417242086
Coefficient of variation (CV)0.0527487487
Kurtosis2.20030684
Mean0.2686778589
Median Absolute Deviation (MAD)0.01069240507
Skewness-1.741719291
Sum576.851363
Variance0.000200857513
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.21917808 0.22792208 0.2322211 0.2384506 0.240006 ... 0.27661028 0.27740757 0.27761721 0.27842377 0.27911392], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2790697674 477 22.2%
 
0.275862069 349 16.3%
 
0.2774566474 295 13.7%
 
0.2711864407 160 7.5%
 
0.2666666667 106 4.9%
 
0.2742857143 100 4.7%
 
0.262295082 90 4.2%
 
0.25 63 2.9%
 
0.2727272727 49 2.3%
 
0.2580645161 47 2.2%
 
Other values (34) 411 19.1%
 
ValueCountFrequency (%) 
0.2191780822 16 0.7%
 
0.2222222222 15 0.7%
 
0.2253521127 11 0.5%
 
0.2272727273 2 0.1%
 
0.2285714286 28 1.3%
 
ValueCountFrequency (%) 
0.2791139241 6 0.3%
 
0.2790697674 477 22.2%
 
0.2777777778 15 0.7%
 
0.2774566474 295 13.7%
 
0.2773584906 2 0.1%
 

meandom
Real number (ℝ≥0)

Distinct count2083
Unique (%)97.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9199502507
Minimum0.05684840426
Maximum2.284943182
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.05684840426
5-th percentile0.1989071801
Q10.56640625
median0.8884943182
Q31.236617647
95-th percentile1.787269176
Maximum2.284943182
Range2.228094778
Interquartile range (IQR)0.6702113971

Descriptive statistics

Standard deviation0.4716556733
Coefficient of variation (CV)0.5126969344
Kurtosis-0.4282040493
Mean0.9199502507
Median Absolute Deviation (MAD)0.3856923876
Skewness0.3677127073
Sum1975.133188
Variance0.2224590741
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0568484 0.17830087 0.20757212 0.45424332 1.16064453 1.44995404 1.69431323 2.00176554 2.28494318], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.71875 4 0.2%
 
1.0078125 3 0.1%
 
1.372395833 3 0.1%
 
0.861328125 2 0.1%
 
0.673828125 2 0.1%
 
1.11328125 2 0.1%
 
0.7901785714 2 0.1%
 
0.7265625 2 0.1%
 
0.18359375 2 0.1%
 
1.044270833 2 0.1%
 
Other values (2073) 2123 98.9%
 
ValueCountFrequency (%) 
0.05684840426 1 < 0.1%
 
0.06527217742 1 < 0.1%
 
0.06883445946 1 < 0.1%
 
0.07832532051 1 < 0.1%
 
0.07861328125 1 < 0.1%
 
ValueCountFrequency (%) 
2.284943182 1 < 0.1%
 
2.27290483 1 < 0.1%
 
2.254488032 1 < 0.1%
 
2.253038194 1 < 0.1%
 
2.228816106 1 < 0.1%
 

mindom
Real number (ℝ≥0)

Distinct count39
Unique (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03610754559
Minimum0.0048828125
Maximum0.1640625
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.0048828125
5-th percentile0.0078125
Q10.0234375
median0.0234375
Q30.0234375
95-th percentile0.15625
Maximum0.1640625
Range0.1591796875
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.04089028656
Coefficient of variation (CV)1.132458213
Kurtosis3.357520245
Mean0.03610754559
Median Absolute Deviation (MAD)0.02797173695
Skewness2.135527143
Sum77.52290039
Variance0.001672015535
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00488281 0.00634766 0.01123047 0.01513672 0.0185791 ... 0.14111328 0.14746094 0.15869141 0.16259766 0.1640625 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.0234375 1190 55.4%
 
0.0078125 470 21.9%
 
0.1640625 79 3.7%
 
0.0546875 39 1.8%
 
0.140625 31 1.4%
 
0.15625 29 1.4%
 
0.09375 28 1.3%
 
0.015625 28 1.3%
 
0.0048828125 26 1.2%
 
0.03125 23 1.1%
 
Other values (29) 204 9.5%
 
ValueCountFrequency (%) 
0.0048828125 26 1.2%
 
0.0078125 470 21.9%
 
0.0146484375 1 < 0.1%
 
0.015625 28 1.3%
 
0.02153320313 9 0.4%
 
ValueCountFrequency (%) 
0.1640625 79 3.7%
 
0.1611328125 1 < 0.1%
 
0.15625 29 1.4%
 
0.1484375 15 0.7%
 
0.146484375 3 0.1%
 

maxdom
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count828
Unique (%)38.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.718784455
Minimum0.1328125
Maximum14.3203125
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.1328125
5-th percentile0.5703125
Q13.796875
median5.6953125
Q37.96875
95-th percentile10.4765625
Maximum14.3203125
Range14.1875
Interquartile range (IQR)4.171875

Descriptive statistics

Standard deviation2.931069457
Coefficient of variation (CV)0.5125336478
Kurtosis-0.5353127689
Mean5.718784455
Median Absolute Deviation (MAD)2.366201609
Skewness-0.005241449305
Sum12278.23022
Variance8.591168164
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.1328125 0.22265625 0.91796875 2.5390625 3.47265625 ... 8.00390625 9.64453125 10.32421875 12.0703125 14.3203125 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5.15625 16 0.7%
 
7 15 0.7%
 
5.390625 11 0.5%
 
12.0234375 11 0.5%
 
6.4453125 10 0.5%
 
5.2734375 10 0.5%
 
4.3828125 9 0.4%
 
8.015625 9 0.4%
 
6.140625 9 0.4%
 
8.90625 9 0.4%
 
Other values (818) 2038 94.9%
 
ValueCountFrequency (%) 
0.1328125 1 < 0.1%
 
0.140625 1 < 0.1%
 
0.171875 1 < 0.1%
 
0.1796875 1 < 0.1%
 
0.1953125 2 0.1%
 
ValueCountFrequency (%) 
14.3203125 1 < 0.1%
 
14.109375 2 0.1%
 
13.265625 1 < 0.1%
 
12.75 1 < 0.1%
 
12.7265625 1 < 0.1%
 

dfrange
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count832
Unique (%)38.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.682676909
Minimum0.0703125
Maximum14.296875
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.0703125
5-th percentile0.5546875
Q13.75
median5.6484375
Q37.9453125
95-th percentile10.44609375
Maximum14.296875
Range14.2265625
Interquartile range (IQR)4.1953125

Descriptive statistics

Standard deviation2.932463695
Coefficient of variation (CV)0.5160356188
Kurtosis-0.533482164
Mean5.682676909
Median Absolute Deviation (MAD)2.364690529
Skewness-0.001676140835
Sum12200.70732
Variance8.599343325
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.0703125 0.53515625 0.76855469 0.98046875 2.703125 ... 7.96875 9.45703125 10.20703125 12.03515625 14.296875 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5.1328125 15 0.7%
 
3.75 10 0.5%
 
5.0859375 9 0.4%
 
4.4765625 9 0.4%
 
8.8828125 9 0.4%
 
8.0859375 9 0.4%
 
5.15625 9 0.4%
 
12 9 0.4%
 
8.1328125 9 0.4%
 
7.9921875 9 0.4%
 
Other values (822) 2050 95.5%
 
ValueCountFrequency (%) 
0.0703125 4 0.2%
 
0.078125 1 < 0.1%
 
0.09375 1 < 0.1%
 
0.125 1 < 0.1%
 
0.1328125 1 < 0.1%
 
ValueCountFrequency (%) 
14.296875 1 < 0.1%
 
14.0859375 2 0.1%
 
13.2421875 1 < 0.1%
 
12.7265625 1 < 0.1%
 
12.5625 1 < 0.1%
 

modindx
Real number (ℝ≥0)

Distinct count2140
Unique (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.146303408
Minimum0.01988135321
Maximum0.3703703704
Zeros0
Zeros (%)0.0%
Memory size16.9 KiB

Quantile statistics

Minimum0.01988135321
5-th percentile0.06604905656
Q10.09789981337
median0.1290789474
Q30.1750854285
95-th percentile0.2929986012
Maximum0.3703703704
Range0.3504890172
Interquartile range (IQR)0.07718561509

Descriptive statistics

Standard deviation0.06799654657
Coefficient of variation (CV)0.4647639279
Kurtosis0.9691658323
Mean0.146303408
Median Absolute Deviation (MAD)0.05222248852
Skewness1.146287076
Sum314.113417
Variance0.004623530345
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01988135 0.04411442 0.05535429 0.07141691 0.0853234 ... 0.12885101 0.16658494 0.21359492 0.28836307 0.37037037], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1722488038 2 0.1%
 
0.1176470588 2 0.1%
 
0.1064935065 2 0.1%
 
0.2290513104 2 0.1%
 
0.06043956044 2 0.1%
 
0.1462053571 2 0.1%
 
0.1075268817 2 0.1%
 
0.1509009009 1 < 0.1%
 
0.3150266972 1 < 0.1%
 
0.170018622 1 < 0.1%
 
Other values (2130) 2130 99.2%
 
ValueCountFrequency (%) 
0.01988135321 1 < 0.1%
 
0.02164750958 1 < 0.1%
 
0.02248677249 1 < 0.1%
 
0.02419512195 1 < 0.1%
 
0.02922690132 1 < 0.1%
 
ValueCountFrequency (%) 
0.3703703704 1 < 0.1%
 
0.3692664348 1 < 0.1%
 
0.3691883373 1 < 0.1%
 
0.3626834382 1 < 0.1%
 
0.3609316892 1 < 0.1%
 

label
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.9 KiB
male
1081
female
1066
ValueCountFrequency (%) 
male 1081 50.3%
 
female 1066 49.7%
 

Length

Max length6
Mean length4.993013507
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 5 100.0%
 
ValueCountFrequency (%) 
Latin 5 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

meanfreqsdmedianQ25Q75IQRskewkurtsp.entsfmmodecentroidmeanfunminfunmaxfunmeandommindommaxdomdfrangemodindxlabel
00.1512280.0721110.1580110.0965820.2079550.1113741.2328314.1772960.9633220.7272320.0838780.1512280.0889650.0177980.2500000.2014970.0078120.5625000.5546880.247119male
10.1351200.0791460.1246560.0787200.2060450.1273251.1011744.3337130.9719550.7835680.1042610.1351200.1063980.0169310.2666670.7128120.0078125.4843755.4765620.208274male
20.1327860.0795570.1190900.0679580.2095920.1416341.9325628.3088950.9631810.7383070.1125550.1327860.1101320.0171120.2539680.2982220.0078122.7265622.7187500.125160male
30.1507620.0744630.1601060.0928990.2057180.1128191.5306435.9874980.9675730.7626380.0861970.1507620.1059450.0262300.2666670.4796200.0078125.3125005.3046880.123992male
40.1422390.0780180.1385870.0882060.2085870.1203811.0997464.0702840.9707230.7709920.2191030.1422390.0967290.0179570.2500000.3364760.0078122.1640622.1562500.148272male
50.1343290.0803500.1214510.0755800.2019570.1263771.1903684.7873100.9752460.8045050.0116990.1343290.1058810.0193000.2622950.3403650.0156254.6953124.6796880.089920male
60.1385510.0770540.1275270.0873140.2027390.1154261.6267706.2913650.9660040.7520420.0121010.1385510.1041990.0191390.2622950.2460940.0078122.7187502.7109380.132351male
70.1908460.0657900.2079510.1322800.2443570.1120761.5623047.8343500.9385460.5388100.0501290.1908460.1133230.0175440.2758621.4341150.0078126.3203126.3125000.254780male
80.1683460.0741210.1456180.1157560.2398240.1240682.70433518.4847030.9345230.5597420.0600330.1683460.0834840.0157170.2318840.1465630.0078123.1250003.1171880.059537male
90.1810150.0743690.1692990.1286730.2541750.1255022.58732512.2814320.9152840.4753170.0599570.1810150.0986430.0161450.2758620.2098440.0078123.6953123.6875000.059940male

Last rows

meanfreqsdmedianQ25Q75IQRskewkurtsp.entsfmmodecentroidmeanfunminfunmaxfunmeandommindommaxdomdfrangemodindxlabel
21370.1785730.0466790.1643880.1493090.2046010.0552933.06666815.6840880.8914480.3211690.1530320.1785730.1553800.0254780.2539680.6379210.1484386.1484386.0000000.101291female
21380.2018060.0360570.2016220.1781650.2278720.0497071.5853534.9456340.8847310.2279030.1761170.2018060.1917040.0327200.2758620.5937500.0078125.9218755.9140620.124383female
21390.1836670.0406070.1825340.1564800.2076460.0511662.0541387.4830190.8981380.3139250.1770400.1836670.1492370.0186480.2622950.5503120.0078123.4218753.4140620.166503female
21400.1687940.0858420.1889800.0955580.2402290.1446711.4622485.0779560.9562010.7068610.1844420.1687940.1828630.0206990.2711860.9882810.0078125.8828125.8750000.268617female
21410.1517710.0891470.1859700.0581590.2301990.1720401.2277104.3043540.9620450.7445900.2305470.1517710.2016000.0234260.2666670.7667410.0078124.0078124.0000000.192220female
21420.1460230.0925250.1834340.0417470.2243370.1825901.3849815.1189270.9489990.6598250.2154820.1460230.1956400.0395060.2758620.5338540.0078122.9921882.9843750.258924female
21430.1318840.0847340.1537070.0492850.2011440.1518591.7621296.6303830.9629340.7631820.2008360.1318840.1827900.0837700.2622950.8328990.0078124.2109384.2031250.161929female
21440.1420560.0957980.1837310.0334240.2243600.1909361.8765026.6045090.9468540.6541960.0080060.1420560.2099180.0395060.2758620.4942710.0078122.9375002.9296880.194759female
21450.1436590.0906280.1849760.0435080.2199430.1764351.5910655.3882980.9504360.6754700.2122020.1436590.1723750.0344830.2500000.7913600.0078123.5937503.5859380.311002female
21460.1655090.0928840.1830440.0700720.2508270.1807561.7050295.7691150.9388290.6015290.2677020.1655090.1856070.0622570.2711860.2270220.0078120.5546880.5468750.350000female